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ABSTRACT 

The  relation  between  shape  and  function  is  analysed  in  three  basic  types  of 
objects:  boxes,  buttons,  and  rings.  For  each  of  these  types  of  objects  and  their 
subtypes,  we  give  a  geometrical  description,  show  how  their  function  relates  to 
their  shape,  and  enumerate  some  geometric  heuristics  to  help  determine  their  use 
in  a  given  situation. 


Shape  and  Function  of  Solid  Objects:  Some  Examples 


Ernest  Davis 


1.    Introduction 

This  paper  analyses  the  relation  between  shape  and  function  in  a  few  basic  objects  --  various 
kinds  of  boxes,  buttons,  and  rings.  The  basic  ontology  used  in  the  analysis  has  already  been  stu- 
died in  [1];  it  gives  a  formal  semantics  for  such  concepts  as  "object",  "action",  "physical  law",  etc. 
By  analysing  these  particular  objects  in  detail,  we  wish,  firstly,  to  test  the  adequacy  of  this  ontol- 
ogy in  reasoning  about  shape  and  action,  and,  secondly,  to  determine  what  geometric  predicates 
are  needed  in  this  domain. 

Our  analysis  of  each  type  of  object  has  three  parts.  First,  we  present  a  general  definition  of 
the  object  in  terms  of  its  shape.  Se  ond,  we  enumerate  some  important  facts  about  the  functioning 
of  the  object,  and  express  their,  in  terms  of  our  ontology.  This  expression  is  sometimes  in 
English,  sometimes  in  predicate  calculus,  and  sometimes  in  both,  whichever  is  clearest.  Third,  we 
define  some  of  the  geometric  predicates  which  are  useful  in  determining  whether  an  object  can 
serve  a  given  function  in  a  given  situation,  and  we  give  a  few  simple  heuristics  for  evaluating  these 
predicates  in  simple  cases. 

In  the  intuitive  understanding  of  these  types  of  objects,  there  is  generally  a  continuous  range 
between  example  and  non-examples,  with  a  broad  range  of  dubious  cases  in  between.  Any  formal 
definition  must  therefore  be  somewhat  arbitrary  in  drawing  lines.  We  have  chosen,  as  a  rule,  to 
draw  lines  as  broadly  as  possible,  and  we  leave  it  up  to  the  processes  which  employ  this  theory  not 
to  use  descriptions  which,  though  technically  accurate,  are  inappropriate.  Another  technique  to 
deal  with  these  ambiguities  is  to  define  some  shape  predicates  as  binary  relations  on  objects,  rather 


than  simple  object  properties.  For  example,  we  define  the  concept  of  "cage"  as  a  relation  between 
the  cage  and  its  potential  contents,  rather  than  merely  a  shape  description  of  the  cage.  In  this  way, 
we  can  more  satisfactorily  deal  with  various  "near"  cages;  we  just  state  that  they  are  cages  for 
some  kinds  of  contents,  and  not  for  others. 

This  paper  is  full  of  formulas:  geometric  definitions  and  functional  facts,  which  are  essen- 
tially geometric  theorems.  We  certainly  do  not  envision  a  program  which  will  prove  the  theorems 
from  the  definitions.  Nor,  on  the  other  hand,  are  the  theorems  of  tremendous  interest  as  pieces 
of  pure  mathematics.  The  purpose  of  these  formulas  is  simply  to  express  pieces  --  hopefully, 
representative  pieces  •-  of  commonsense  knowledge  about  commonplace  objects.  The  fact  that 
they  are  theorems  is  significant  because  it  assures  us  that  we  are  not  demanding  more  of  our 
theory  than  we  put  into  it;  the  proof  process  is  not  here  of  interest.  Most  of  the  proofs  of  the 
theorems  are  quite  straightforward;  where  they  are  not,  I  have  indicated  them  briefly. 

2.    Basic  Representation 

This  section  presents  the  definitions  of  the  basic  terms  used  in  our  theory.  Most  of  these 
terms  have  been  already  discussed  ui  [1];  their  discussion  here  will  therefore  be  quite  concise.  A 
few  terms  will  be  given  a  new  def  iiition  and  some  new  terms  will  be  introduced.  Particularly  signi- 
ficant in  the  previous  paper  are  sections  3,  4,  8,  and  9.  I  assume  some  elementary  topology. 

The  definition  of  body  given  in  [1]  is  slightly  modified.  Previously,  we  defined  a  body  as  any 
closed,  bounded  subset  of  R3.  We  wish  to  strengthen  this  by  requiring  further  that  a  body  be  con- 
nected, and  that  it  be  equal  to  the  closure  of  its  interior.  Thus  we  rule  out  two-  or  one- 
dimensional  objects  or  pieces  of  objects.  (Note  the  contrast  with  Hayes  [2],  who  uses  these  exten- 
sively.) This  restriction  is  useful  in  a  number  of  contexts;  for  example,  it  allows  us  to  state  that 
two  objects  on  opposite  sides  of  a  wall  are  not  adjacent. 

As  in  [1],  we  assume  the  existence  of  a  number  of  distiguishable  objects.  The  shape  of  an 
object  O  is  defined  in  terms  of  a  point  set  in  some  standard  position,  denoted  "stand(O)".  Other 
positions  of  the  body  are  defined  in  terms  of  mappings,  which  are  rigid  Euclidean  transformations. 


(Our  theory  treats  only  hard,  rigid  objects.)  Given  a  mapping  M  and  a  point  set  P,  we  write 
image(A/,/>)=!M(F),  the  image  of  P  under  M.  A  scene  is  a  function  from  a  set  of  objects  to  map- 
pings, thus  defining  the  positions  of  each  of  the  objects.  Given  a  scene  5,  "objects(5)"  is  the 
objects  in  S,  that  is,  the  domain  of  S.  Given  a  scene  S  and  an  object  O, 
scenemapping(5,0)=5(0),  the  mapping  which  locates  O  in  S.  The  point  set  occupied  by  0  in  S  is 
designated  "place(0,S)"  and  may  be  defined  as  image(scenemapping(S,0),stand(0)).  We  define  a 
new  predicate  on  two  scenes  "subscene(51^S2)"  t0  mean  t*iat  St  is  a  restriction  of  S2  to  a  subclass 
of  objects. 

We  frequently  wish  to  consider  other  point  sets  as  moving  along  with  an  object;  for  example, 
the  hole  of  a  doughnut  moves  with  the  doughnut.  In  [1]  we  used  the  notation 
"sceneplace(/>,0,S)"  to  indicate  the  position  of  a  point  set  P  associated  with  an  object  O  in  a  scene 
S  containing  0.  Thus,  'sccneplace(HOLEJ)OUGHNUTJS)"  would  be  the  position  of  the  hole  of 
the  doughnut  in  scene  5.  It  turns  out  to  be  convenient  to  use  the  same  notation  for  these  point 
sets  as  for  true  objects.  We  therefore  proceed  as  follows.  We  define  P  to  be  a  "pseudo-object"; 
analogous  to  an  object,  except  for  three  differences:  its  shape  can  be  an  arbitrary  point  set,  its 
position  is  dependent  on  some  re  d  object,  and  it  has  no  physical  reality,  and  is  therefore  not 
directly  affected  by  physical  laws.  We  associate  two  functions  with  pseudo-objects:  "source_of(/')", 
the  real  object  with  which  P  is  associated;  and  "stand(F)",  which  is  the  point  set  which  P  occupies 
when  0  is  in  standard  position.  The  function  "source_of '  is  also  defined  on  real  objects,  and  is  the 
object  itself:  source_of(0)=0.  Now  we  can  redefine  "place"  as  a  two-place  function  which  applies 
equally  to  objects  and  pseudo-objects: 

place(g,5)  ■  irnage(scenemapping(source_of(C)yS),stand(C)). 
That    is,    stand(Q)    and    stand(souce_of(C))    define    a    standard    arrangement    for    the 
pseudo_object  Q  and  its  associated  real  object.   In  all  other  scenes,  the  point  set  of  Q  is  deter- 
mined by  the  requirements  that  both  its  shape  and  its  position  relative  to  the  position  of 
source_of(2)  in  the  scene  remain  unchanged. 

We  also  modify  a  convention  established  in  [1]  by  which,  for  a  geometric  predicate  g,  and 


objects  O.,  02,  ...  "g(Ox,02  •  •  •  S)"  was  shorthand  for  "g(place(0:,S)1  place(C»:)5)  ...)"•  It  is 
both  more  readable  and  more  consonant  with  previous  work  (for  example,  [3]  and  [4])  to  write 
this  after  the  fashion  of  modal  logic  as  "true  (S,g(Ox,02,  •  •  • ))  (read  "g{0-,02,. ■•)  is  true  in  5"). 
No  confusion  arises  as  long  as  it  is  kept  in  mind  that  this  is  merely  an  abbreviation  for  the  longer 
form.  Likewise,  a  geometric  function  defined  on  bodies  may  be  applied  to  objects  within  the  scope 
of  a  relativization  to  a  scene  5;  it  then  is  interpreted  as  applying  to  their  places  in  S.  Thus,  for 
example,  "true  (S,abuts(0.lJ0:,03))"  ^  an  abbreviation  for 

"abuts(place(0.  ,S)  U  pla^(O:>S),place<03,5)^ 
Neither  the  predicate  "abuts"  nor  the  function  "union"  can  strictly  speaking  be  applied  to  objects; 

they  are  therefore  interpreted  as  applying  to  the  positions  of  the  objects  in  S. 

The  standard  scene  "stand_scene"  is  defined  such  that  every  object  is  in  its  standard  position. 
Thus,  instead  of  "stand(O)"  we  can  write  "place(0,  stand_scene)". 

The  definitions  of  the  basic  terms  associated  with  histories  remain  the  same  as  in  [1].  A  his- 
tory is  a  continuous  function  which  maps  time  to  scenes.  The  range  of  a  history  is  a  single  interval, 
which  must  be  closed  on  the  left,  ?nd  may  be  either  closed  or  infinite  on  the  right.  Given  a  his- 
tory H,  "scenes(//)"  denotes  the  se'  of  all  scenes  occuring  in  H.  "Startscene(//)"  and  "endscene(//)" 
are  respectively  the  first  and  last  scene  in  H.  We  write  "subhistory(// -yH-J"  to  mean  that  H-  is  a 
restriction  of  H2  to  a  subinterval.  We  introduce  a  new  notation  "object_subhistory(//- ,//;)"  to 
mean  that  H-  is  restriction  of  H2  to  a  subclass  of  objects. 

The  motion  of  an  object  0  in  a  history  H  is  the  restriction  of  H  to  O.  The  net  motion  of  0  in 
H  is  the  mapping  which  takes  its  starting  position  into  its  ending  position.  O  moves  in  H  if  it  has 
different  positions  in  different  scenes  of  H. 

We  use  a  number  of  basic  geometric  predicates  on  bodies.  Two  bodies  are  said  to  overlap  if 
their  interiors  intersect.  Given  a  set  of  bodies  BB,  no  two  of  which  overlap,  we  write 
"non_overlap(5fl)".  Two  bodies  abut  if  they  do  not  overlap,  but  their  boundaries  intersect.  Two 
bodies  are  disjoint  if  they  do  not  intersect  at  all.  A  body  is  a  simple  solid  if  it  is  homeomorphic  to 
a  solid  sphere;  that  is  it  has  no  holes  at  all,  either  inside  it  or  through  it. 
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We  also  need  a  few  geometric  terms  applying  to  point  sets  either  than  bodies.  We  will  use 
the  concept  of  a  points  set  P  being  homeomorphic  to  a  two-dimensional  band;  like,  for  example, 
the  region  between  two  concentric  circles.  An  arc  is  a  finite  curve  containing  its  endpoints  which 
does  not  intersect  itself  and  has  distinct  endpoints.  A  face  is  a  two-dimensional  surface 
homeomorphic  to  a  simple  disc;  for  example,  a  solid  circle,  including  its  interior.  A  directed  face  is 
a  face  with  a  distinguished  orientation. 

We  express  natural  laws  in  terms  of  constraints,  which  are  the  sets  of  histories  allowable 
under  the  laws.  We  will  use  in  this  paper  most  of  the  constraints  defined  in  [1].  The  "hardobjs" 
constraint  is  the  set  of  histories  in  which  no  two  objects  ever  overlap.  Together  with  this,  we  also 
define  the  set  of  scenes  "hard_obj_scenes",  as  the  set  of  scenes  in  which  no  two  objects  overlap. 
The  "push_driven"  constraint  requires  that  no  object  move  in  the  history  unless  some  animate 
object  was  pushing  it,  directly  or  indirectly.  ("Animate"  is  an  undefined  qualitative  term,  in  terms 
of  our  theory.)  The  "stationary_ground"  constraint  asserts  that  the  "ground",  a  distinguished 
object,  does  not  move.  The  "must_fall"  constraint  asserts  that  an  unsupported  object  other  than 
the  ground  must  move  downward.  ("Up"  is  a  distingished  direction  in  our  theory.)  The  "no_up" 
constraint  asserts  that  no  object  moves  upward  unless  it  is  driven  by  a  sequence  of  pushings  and 
carryings  starting  either  with  an  <  nimate  object  or  with  a  falling  object.  Finally  the  "gravity"  con- 
straint is  the  conjuction  of  the  "hardobjs",  "must_fall",  "stationary_ground",  and  "no_up"  con- 
straints. (This  does  not  give  a  full  theory  of  gravity.)  Full  formal  definitions  are  given  in  section  9 
of  [1]. 

3.   Boxes 

3.1.  Closed  Boxes 

A  box  B  is  a  body  whose  complement  has  at  least  two  connected  components.  The  infinite 
component  is  designated  "outside(5)".  The  set  of  finite  components  is  designated  "insides(5)".  It 
is  a  fact  that  when  boxes  are  moved  from  one  scene  to  another,  they  remain  boxes,  and  the  inside 
and  outside  move  with  them. 


for  all  (0  ,/>,£?) 
[  0  =  source_of(P)  =  source_of(G)  and 

true  (stand_scene,  box(0)  and  P=insides(0)  and  Q=outside(0)  ]  implies 

for  all  (S)  [  Ocobjects(S)  implies  true  (5,  box(0)  and  /»=insides(0)  and  C  =  outside(0))] 

By  virtue  of  the  above  fact,  we  may  write  "box  (0)"  of  an  object  0  independent  of  the 

scene.    We  define  "inside(5)"  as  the  union  of  all  the  insides  of  B.  Given  an  body  0  and  a  box  B 

we     define     "isinside(0,fl)"     iff     subset(0,inside(fl))     and     similarly     "isoutside(0,fl)"     iff 

subset(0,outside(fl)). 

The  following  facts  about  boxes  under  hard  objects  conditions  are  important.  (Al)  and 
(A  2)  are  geometrical,  relating  to  scenes;  (A. 3)  and  (A. 4)  are  dynamic,  relating  to  histories;  and 
(A. 5),  (A. 6),  and  (A-7)  have  functional  implications. 

Al)  In  any  scene,  all  other  objects  are  either  inside  or  outside  a  box. 

A  2)  No  object  inside  a  box  can  abut  an  object  outside  the  box. 

A3)  In  any  history,  any  other  object  is  either  always  inside  or  always  outside  a  box. 

A 4)  Motions  of  objects  inside  th    box  cannot  affect  those  of  objects  outside  the  box,  except 

through  the  box;  and  vice  versa. 
A5)  Objects  inside  the  box  are  protected  from  those  outside  the  box.   and  vice  versa. 
A 6)  An  object  inside  the  box  can  be  moved  by  pushing  the  box. 
AT)  If  an  empty  box  can  be  moved  in  a  particular  way  around  obstacles  then  it  can  be  moved  in 

the  same  way  with  objects  inside. 

Many  other  facts  could  be  enumerated  --  for  example,  two  objects  inside  a  box  cannot  get 
more  than  a  certain  distance  apart  --  but  these  seem  like  a  good  start.  We  will  now  develop  these 
facts  formally  in  sequence. 

Al:  In  any  scene,  all  other  objects  are  either  inside  or  outside  a  box.   Formally, 
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for  all  (S£hard_obj_scenes,Otfl) 
[  box(B)  andfl^O  andfl,0eobjects(S)]  implies  true  (S,  isinside(0,fl)  or  isoutside(0,fl)) 
A.2:  No  object  inside  a  box  can  abut  an  object  outside  the  box.  Formally, 

for  all  (5^,2)  I*  Bf,Q  are  bodies  rather  than  objects  •/ 

not  [box(5)  and  isinside(P,fl)  and  isoutside(Q3)  and  abut(P,Q)  ] 
A. 3:  In  any  history,  any  other  object  is  either  always  inside  or  always  outside  a  box.  For- 
mally, - 

for  all  (//ehardobjs;  fl,0tobjects(//)) 

exactly_one 

[  [for  all  (Sescenes(tf))  true  (5,  isinside(fl,0))], 

[for  all  (Sescenes(#))  true  (5,  isoutside(fl,0))], 

[B=0]  ] 
("exactly_one  (A,B,C)"  is  a  boolean  function,  meaning  that  exactly  one  of  the  arguments  holds.) 

In  consequence  of  this  result,  we  will  write  "isinside(0 ,£,#)"  or  "isoutside(0^^/)"  to  mean 
that  0  is  inside  or  outside  B  throughout  the  history  H. 

A. 4:  Motions  of  objects  insicli  the  box  cannot  affect  those  of  objects  outside  the  box,  except 
through  the  box;  and  vice  versa.  Formally,  given  two  histories  satisfying  the  hard  objects  con- 
straint with  the  same  motion  of  the  box,  such  that  one  does  not  involve  objects  inside,  and  the 
other  does  not  involve  objects  outside,  the  two  may  be  spliced  together. 

for  all  (/^ehardobjs,  //;ehardobjs,  B) 
[  for  all  (O^objects^))  not  (isinside(0ltfl,//))  and 
for  all  (02eobjects(//2))  not  (isoutside(02,fl,//))  and 
{B}  =  objects(//. )  n  objects^,)  and 
mouon(5^:)  =  motion(fl,#2)  and  box(fl)  ]  implies 
exists  (Z/'thardobjs)  [object_subhistory(//1^')  and  object_subhistory(#;,//' )] 
An  similar  result,  more  functionally  suggestive,  is  that,  given  the  "push_driven"  constraint,  a 
robot  standing  outside  the  box  cannot  move  an  object  inside  the  box  without  moving  the  box 
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itself.  (It  is  also  true,  of  course,  that  objects  outside  the  box  do  not  move  unless  there  is  a  robot 
outside  the  box,  but  that  is  less  important.  The  reason  that  you  cannot  push  a  box  from  inside 
very  effectively  is  beyond  the  range  of  our  simple  dynamics.)  Formally, 

for  all  (//€push_driven;fl,Oeobjects(//)) 
[  isinside(0,2?,//)  and  movesin(0 ,//)  ]  implies 
[  exists  (/?eobjects(//))  animate(/?)  and  isinside(J?  ,£>,//)  ]  or  movesin(£ ,//)  ] 
A.5:  Objects  inside  the  box  are  protected  from  those  outside  the  box,  and  vice  versa.  To 
state  this  formally,  we  first  define  "protected  (Ol,02£JiH)"  (object  0-  is  protected  from  object 
02  throughout  histories  in  HH  starting  with  S)  as  follows:  no  history  in  HH  starting  with  S  causes 
O-  and  02  to  come  into  contact.  (Note  that  the  definition  is  symmetric  in  O-  and  0-.)  Formally 

protected(0.  ,0,  ,$,//#)  iff 
forall(//tW/)  [subscene(S,  startscene(//))  implies 

forall  (S'escenes(#))  [true  (S',disjoint(0., <?;))]] 
We  can  now  state  the  desired  result  about  boxes: 

foraU(fl,0.,0;,S) 

[  isinside(0.,fl,5)  and  is'.utside(02,fl,5)]  implies  protected(0.,0:,5,hardobjs) 
A.6:  An  object  inside  the  box  can  be  moved  by  pushing  the  box.   It  is  not,  in  general  possi- 
ble to  move  the  object  to  any  specified  position  by  pushing  on  the  box,  but  it  is  possible  using  first 
a  push  of  the  box,  and  then  a  motion  of  the  object  within  the  box.  Formally, 

/•  S  is  the  starting  scene;  M2  shows  the  ending  position  of  the  box;  R  is  a  robot  •/ 
for  all  (O.fl.S./f.A/J 
[  true  (S,  isinside(O^)  and  isoutside(/?,fl))  and  animate^ )  ]  implies 
exists  (//•thardobjsnpushdrim1.  //2thardobjs) 
[  objects(//)= objects^J  =  {0  ft  M}  and 
startscene(//)=5  and  scenemapping(0 ,  endscene^J)  =  A/-  and 
endscene(//. )  =  startscene(//-)  and  not  (movesin(B ,//-))  ] 


A.  7:  If  an  empty  box  can  be  moved  in  a  particular  way  around  obstacles  then  it  can  be 
moved  in  the  same  way  with  objects  inside.  In  our  ontology,  an  obstacle  is  a  constraint  that  some 
particular  object  or  objects  never  move.  The  above  principle,  however,  turns  out  to  be  false  if  the 
obstacles  are  inside  the  box;  in  that  case,  they  can  interact  with  the  content  so  as  to  obstruct  the 
box.  The  idea  of  an  obstacle  inside  a  closed  box  seems  implausible  because  real  fixed  obstacles  are 
generally  directly  or  indirectly  attached  to  the  ground  or  some  other  large  feature  of  the  environ- 
ment; this  assumed,  our  principle  follows.  We  proceed  as  follows  to  formalize  all  this. 

We  note,  first,  that  the  ground  is  infinite  while  the  inside  of  a  box  is  finite.  (These  are 
axioms.)  It  follows,  therefore,  that  in  a  hard  objects  scene,  the  ground  is  outside  the  box.  Using 
fact  A.2,  it  follows  that  anything  that  abuts  the  ground  •-  using  induction,  any  thing  which  is  in  a 
chain  of  abutments  to  the  ground,  where  the  chain  does  not  include  the  box  itself  --  is  likewise 
outside  the  box. 

Now  we  can  define  obstacles  attached  to  the  ground.  Firstly,  we  define  "directly  fixed"  and 
"mutually  fixed"  as  functions  which  define  constraints.  A  is  directly  fixed  to  B  in  histories  where 
they  abut  and  where  their  relative  position  does  not  change.  A  set  of  objects  00  is  mutually  fixed 
if  there  is  a  chain  of  fixings  in  OO  'Xmnecting  any  two  elements  of  00. 

direct_fixed(A,fl)  «» 

{  H  |  forall  (Sfcscenes(j/)) 

true(5,abut(A3))  and 

netmotion(A,startscene(#),S)  =  netmotion(fl,startscene(#),S)  } 
mutually_fixed(C>0)  » 
{//|forall(A,fle00) 

exists  (AX=A^  '  '  '  An=B) 
Ate.00  for  i  =  1  ton  and  #€direct_iixed(i4l,A1+1)  for  i=l  to  n-1  } 
Given  scene  S  and  object  set  OO  containing  the  ground,  we  now  define  "obstacle(00,5)"  as 
the  constraint  where  all  the  objects  are  permanently  fixed  to  the  ground  in  the  positions  described 
by  S.  If  there  are  objects  in  OO  which  do  not  abut  the  ground  in  5,  directly  or  indirectly,  then 
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"obstacle(00,S)"  is  the  null  set. 

If  groundeOO  and  subset(00,objects(5))  then 
obstacle(00,5)  - 
hardobjs  f)  stationary _ground  f)  mutually_fixed(00) 

n  {  H  |  subscene(S,  startscene(//))  } 
It  follows  that  all  the  objects  in  00  occupy  the  same  position  in  all  the  scenes  of  H.  For- 
mally, for  all  (#6obstacle(00,S)),  5'€scenes(//))  subscene(S,S') 

We  can  now  state  the  desired  result:  putting  objects  inside  a  box  does  not  affect  whether  it 
can  be  moved  around  obstacles.  We  express  this  as  follows:  Given  a  history  H  of  moving  the  box 
around  with  objects  inside,  and  a  history  H'  of  the  same  motions  of  the  box  without  the  objects,  if 
H'  avoids  the  obstacles  then  so  does  H. 

for  all  (fl, 00 ,5) 
for  all  (f/'eobstacle(00,S),  tfehardobjs) 

[  box  (B)  and  Be  objects(tf')  and  object_subhistory(//'  Ji)  and 

for  all  (Oe[objects(#;  -  objects(//')])  isinside(0^^)  ] 
implies  #eobstacles('  OJS) 
In  applying  the  theory  of  boxes  to  particular  objects,  the  most  important  predicate  is  that  of 

an  object  fitting  inside  a  cavity.  Object  0  fits  inside  an  interior  space  /  of  box  B  if  there  is  any 
scene  S  such  that  0  is  a  subset  of  /  in  S.  A  naive  reasoner  would  be  supplied  with  a  number  of 
simple  heuristics  for  evaluating  whether  0  fits  inside  /,  such  as  "If  0  and  /  are  spherical,  then  0 
fits  inside  /  iff  the  radius  of  0  is  less  than  that  of  /";  If  0  fits  inside  /  and  /  fits  inside  J  then  I 
fits  inside  J";  "If  the  maximal  diameter  of  0  is  greater  than  the  maximal  diameter  of  /,  then  O 
does  not  fit  inside  /";  etc. 

3.2.   Open  Boxes 

Open  boxes  are  substantially  less  clear  cut  than  closed  boxes.  Open  boxes  can  be 
transformed  smoothly  into  random  objects  without  ever  crossing  any  clear  dividing  point.  Imagine, 
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for  example,  starting  with  an  ordinary  box  with  an  opening  on  top,  and  gradually  lowering  the 
walls;  or  opening  a  crack  in  one  of  the  walls,  starting  at  the  open  top,  working  down  to  the  floor, 
and  then  gradually  getting  wider  and  wider  until  the  floor  has  disappeared;  or  making  a  bump  in 
the  middle  of  the  floor  and  gradually  raising  it  until  it  towers  high  above  the  walls,  which  sur- 
round it  like  the  brim  of  a  hat. 

To  avoid  drawing  aritrary  cutoff  points,  we  define  "open  box"  as  a.predicate  applying  to  two 
bodies:  the  container  and  the  opening.  We  define  a  object  C  and  a  opening  P  to  form  an  open  box 
if  the  following  conditions  are  met:  C\JP  is  a  closed  box;  P  is  a  simple  solid;  P  abuts  but  does 
not  overlap  C;  P  borders  both  the  outside  and  the  inside  of  C\^JP;  and  both  of  these  borders 
consist  of  a  single  connected  face. 

Straightforward  examples  would  be  where  C  is  an  ordinary  open  box  and  P  covers  the  open- 
ing. There  is  great  leeway  in  choosing  P  as  long  as  the  topological  relations  with  C  are  satisfied.  P 
may  be  chosen  to  sit  just  inside  the  top  of  the  box,  or  just  over  the  top  of  the  box,  as  it  seals  the 
box,  and  is  a  legitimate  3-dimensional  object,  not  a  two-dimensional  surface.  It  may  make  a  great 
bubble  over  the  box,  or  it  may  be  placed  well  inside  the  box,  and  block  off  only  a  small  corner. 
All  of  these  are  equally  valid  inst.  nces  of  the  definition;  in  different  circumstances,  some  will  be 
more  useful  than  others.  As  the  box  is  transformed,  in  the  ways  discussed  above,  the  opening  may 
be  transformed  with  it;  again,  the  definition  remains  satisfied,  though  the  usefulness  or  appropri- 
ateness of  the  description  may  diminish.  Other  kinds  of  perverse  examples  are  possible.  For  exam- 
ple, any  body  is  an  open  box  relative  to  an  opening  which  sits  like  a  hollow  dome  on  an  unbroken 
part  of  its  outer  surface. 

The  importance  of  each  condition  of  the  definition  is  best  illustrated  by  counter-examples.  A 
ring  is  not  an  open  box  with  respect  to  its  hole,  because  their  union  is  not  a  closed  box.  A  closed 
box  with  a  ring  on  the  outside  (like  a  suitcase  with  a  handle)  is  not  an  open  box  with  respect  to 
the  hole  in  the  ring,  because  the  opening  does  not  border  the  inside  of  their  union.  A  pair  of 
boxes  joined  by  a  tunnel  is  not  an  open  box  with  respect  to  the  tunnel,  because  the  tunnel  does 
not  border  the  outside.  A  yo-yo  is  not  an  open  box  with  respect  to  a  strip  running  around  the 
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outside,  because  the  strip  is  a  torus,  not  a  simple  object.  A  box  with  two  openings  is  not  an  open 
box  with  respect  to  any  body  which  plugs  both  openings.  If  the  two  plugs  are  connected  on  the 
outside  of  the  box,  then  the  border  of  the  body  with  the  inside  consists  of  two  separate  faces.  If 
the  two  plugs  are  connected  on  the  inside,  the  the  border  on  the  outside  is  two  separate  faces.  If 
they  are  connected  on  both,  then  the  body  is  not  a  simple  object. 

An  alternative  definition  of  an  open  box  would  be  in  terms  of  an  object  0  and  a  cavity  C 
gouged  out  of  O.  <C,0>  is  an  open  box  if  C  abuts  O,  C  does  not  border  any  inside  of  C\^JO, 
and  C  borders  the  outside  of  C(JO  in  a  single  simply  connected  face.  We  can  identify  C  here 
with  the  union  of  the  opening  and  the  inside  in  the  first  defintion;  in  that  case,  the  two  definitions 
are  nearly  equivalent,  though  differing  in  a  few  marginal  cases.  We  have  adopted  the  first  defini- 
tion, despite  its  greater  complexity,  for  two  reasons.  Firstly,  it  emphasizes  the  connection  between 
open  boxes  and  closed  boxes.  Secondly,  there  are  many  important  cases  where  it  is  natural  to  con- 
sider the  opening  separately  from  the  interior;  for  example,  thin  necked  bottles. 

We  may  now  state  a  number  of  important  facts  about  open  boxes.  Some  of  these  are 
analogues  of  the  facts  we  have  stated  about  closed  boxes.  In  accordance  with  the  convention 
above,  we  will  write  "open_box(5.  ')"  if  5  is  an  object,  P  is  a  pseudo_object  whose  souce  is  B  and 
they  form  an  geometrical  open  bcx  in  any  scene. 

B.l)  Any  object  which  intersects  both  the  inside  and  outside  of  an  open  box  also  intersects  the 
opening.  (Analogous  to  Al). 

B.2)  Any  object  which  goes  from  outside  an  open  box  to  inside  an  open  box  goes  through  the 
opening.  (Analogous  to  A3) 

B.3)  An  object  inside  an  open  box  which  opens  on  top  will  not  fall  out  of  the  box. 

B.4)  Under  gravity,  an  object  inside  a  box  which  opens  on  top  will  not  come  out  of  the  box  if  the 
box  is  moved  without  tilting.   (Weakly  analogous  to  A6). 

The  formal  statements  are  as  follows: 

B.l:  Any  object  which  intersects  both  the  inside  and  outside  of  an  open  box  also  intersects 


-13- 
the  opening. 

for  all  (fl,0,P) 
[  opeh_box(fl  J1)  and  non_overlap(5,0)  and 
intersect(0,inside(5UP)  and  intersect(0,outside(5lJ/>))] 

implies  intersect^,/*) 
B.2:  Any  object  which  goes  from  outside  an  open  box  to  inside  an  open  box  goes  through 
the  opening.  This  can  be  formulated  either  in  weak  or  strong  form,  depending  on  our  definition  of 
"goes  through".  We  can  define  it  simply  to  mean  that  at  some  instant  the  object  intersects  the 
opening.  In  that  case  we  can  formalize  the  fact  simply  as  follows. 

for  all  (//ehardobjs;2?,0eobjects(//);i>) 
[  open_box(5^')  and 

true(startscene(//),isoutside(0,fllJP))  ^  true(endscene(#),isinside(C>,flU/'))  ] 
implies  exists  (Sescenes(/0)  true(S,intersect(0,/')) 
A  more  detailed  analysis  of  an  object  "going  through"  a  space  gives  us  a  stronger,  and  some- 
times more  useful,  result.  We  mus*  define  some  new  geometrical  terms.  We  begin  with  the  sim- 
plest case:  a  point  going  through  a  face.  Given  a  directed  face  F  in  /?3  --  that  is,  a  directed  surface 
homeomorphic  to  a  disk  --  and  a  continuous  function  p  from  a  closed  interval  I=[L,U]  to  /?3 
representing  the  motion  of  a  point  over  time,  we  define  p  as  going  into  F  if  F  may  be  extended  to 
an  infinite  directed  surface  F' ,  homeomorphic  to  a  plane,  such  that 

1)  The  complement  of  F'  consists  of  two  connected  components,  the  positive  space  F'  +  and 
the  negative  space  F'  — ; 

2)  p(L)iF'  +  ;p(U)*F'-;and 

3)  for  Td,  if  p(T)*F'  then  p{T)iF. 

(See  figure  1).  The  last  condition  states  that  p  stays  clear  of  the  rest  of  F'  outside  of  F.  We 
define  p  going  out  of  F  analogously,  merely  reversing  +  and  -  in  (2).  Given  pseudo-objects  p,  a 
point,  and  F,  a  directed  face,  and  a  history  H,  we  state  that  p  goes  into  F  in  H  if  conditions  (1)  to 
(3)  above  hold,  substituting  "p\ace(pJi(T))"  for  "p(T)'\  "place  (FJi(T))"  for  "F",  etc. 
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We  may  next  define  an  object  going  through  a  space.  Given  an  object  0,  a  pseudo-object  Q 
which  is  a  simple  solid,  and  two  faces  F.  and  F2  of  Q  directed  outward,  we  say  that  0  goes 
through  Q  from  F.  to  F2  in  history  H  if,  for  every  point  p  in  0, 

1)  p  goes  into  F,  in  H; 

2)  p  goes  out  of  F2  in  //; 

3)  p  is  disjoint  from  C  at  both  startscene(//)  and  endscene(//);  and 

4)  Throughout  H,  p  is  disjoint  from  any  part  of  the  boundary  of  Q  other  than  F  and  F2. 

In  the  case  of  an  open  box,  the  boundary  opening  is  divided  by  the  box  into  the  part  where 
the  boundary  meets  the  box,  the  outer  face,  and  the  inner  face.  Fact  B.2  above,  states  that  if  an 
object  goes  from  outside  to  inside  a  open  box,  it  goes  through  the  opening,  from  the  outer  face  to 
the  inner  face. 

B.3:  An  object  inside  an  open  box  which  opens  on  top  will  not  fall  out  of  the  box.  Specifi- 
cally, if  a  box  does  not  move  in  a  given  history,  and  the  opening  of  the  box  is  entirely  above  the 
inside  of  the  box,  and  no  other  object  is  ever  inside  the  box,  then  an  object  which  starts  inside  the 
box  must  stay  inside  the  box.   This  follows  directly  from  B.2.  Formally 

forall  (tfehardobjsDniusi.fall;  fl,0tobjects(//); P) 
[  open_box(B  J1)  and  not  (movesin(5^/))  and 
true  (startscene(//) , 

[  isJnside(Oft\JP))  and  for  all  (ptP,  ^tinside(OU^)))  p-teq-z  ])  and 
for  all  (0'cobjects(//)-{B,0},  S€scenes(#))  true(S,is_outside(<7rflU/>))  ] 
implies  is_inside(<9  Ji  \^JP  ,endscene(//)) 
B.4:  Under  gravity,  an  object  inside  a  box  which  opens  on  top  will  not  come  out  of  the  box 
if  the  box  is  moved  without  tilting.    This,  as  it  happens,  is  not  quite  true  under  the  definition 
given  for  gravity  in  [1].  The  first  is  that  we  can  move  the  box  down  faster  than  the  object  can  fall. 
The  other  is,  if  the  walls  of  the  box  slant  outward,  we  can  use  them  as  a  wedge  to  push  the  object 
up.    We  can  rule  out  the  first  problem  by  restricting  the  fact  to  cases  where  the  box  does  not 


-15- 

move  downward,  and  nile  out  the  second  by  insisting  that  the  box  have  a  "dead  zone"  where  the 
walls  offer  no  support.  We  can  formalize  this  concept  in  the  following  way.  First,  given  a  body  B, 
and  a  point  q  on  the  surface  of  B ,  we  say  that  q  faces  upward  in  5  if  there  is  some  neighborhood 
of  q  in  the  horizontal  plane  through  q  which  does  not  overlap  with  the  interior  of  B.  Now,  given  a 
box  B  with  opening  P,  we  define  D,  which  is  a  range  of  heights,  as  a  dead  zone  of  <BJ>>  if  P  is 
entirely  above  D  and  no  part  of  the  inner  surface  of  B  with  height  in  D  has  any  point  which  faces 
upward  in  B.  We  can  state  the  following  theorem: 

Let  3  be  a  box  with  opening  P.  Let  L  be  a  pseudo-object  of  B  which  is  a  line  segment  in 
space.  Let  0  be  a  object.  Let  H  be  a  history  with  the  following  properties: 

1)  H  observes  the  gravity  constraint. 

2)  O  is  inside  <Bf>  and  below  L  in  the  starting  scene  of  H. 

3)  The  vertical  projection  of  1  is  a  dead  zone  of  <Bf>  in  every  scene  of  H. 

4)  The  vertical  projection  of  L  is  always  greater  than  the  diameter  of  0. 

5)  The  lower  end  of  L  never  moves  down  in  the  course  of  H. 

6)  No  object  besides  O  is  ever  ir^ide  <Bf>. 

Then  it  foDows  that  O  is  alv  ays  inside  <BJ*>  during  H. 

The  proof  is  simple.  By  continuity,  O  must  pass  through  the  vertical  projection  of  D.  Since 
the  diameter  of  O  is  less  than  the  thickness  of  the  dead  zone,  0  must  be  at  some  point  entirely 
contained  within  the  dead  zone.  When  O  is  entirely  within  the  dead  zone,  it  cannot  have  any  sup- 
port. 

The  important  predicates  for  applying  the  theory  of  open  boxes  to  individual  objects  are  "fits 
inside",  already  described,  "fits  through",  "fits  into",  and  "caps".  Object  0  fits  through  opening  P 
from  face  Fx  to  F2  if  there  is  a  history  in  which  0  goes  through  P  from  Fx  to  F2.  Object  0  fits 
into  open  box  <BJ'>  iff  there  is  a  history  in  hardobjs  in  which  0  goes  from  outside  <BJ3>  to 
inside  <Bf>.  Pseudo-object  P  caps  object  B  if  P(Jfl  *  a  dosed  box. 
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Typical  heuristics  for  computing  these  predicates  are  "If  O  fits  into  <B,P>,  and  Q  is  a  sub- 
set of  O  then  Q  fits  into  <BJ'>";  "If  O  fits  through  P  from  F  to  F2  and  Q  contains  P  and  G 
contains  F  and  G2  contains  F2  then  0  fits  through  Q  from  G.  to  G2";  "If  0  has  a  planar  projec- 
tion H,  and  P  is  a  right  cylinder  with  faces  F-  and  F:  and  H  fits  into  F-  (in  two  dimensions)  then 
O  fits  through  P  from  F-  to  F2."  The  "caps"  predicate  requires  somewhat  different  methods; 
shrinking  or  expanding  either  object  generally  destroys  the  relationship.  Therefore,  this  relation  is 
generally  made  part  of  the  definition  of  the  pseudo-object.  The  problem  then  becomes  an  existen- 
tial one  of  showing  that  such  a  body  exists. 

Also  useful  is  the  specialized  predicate  "h_fits_throiigh(0,/>r/rr/;'2>^')  which  asserts  that 
there  are  histories  in  HH  P|  hardobjs  in  which  O  goes  through  P  from  F-  to  F2.  This  is  mainly 
used  in  the  negative  in  asserting,  for  example,  that  a  particular  button  will  not  go  through  a  hole 
unless  it  is  rotated.  Each  different  kind  of  history  set  here  has  a  different  kind  of  heuristic. 

The  concept  of  an  open  box  can  be  extended  in  a  natural  way  to  the  concept  of  a  box  with 
many  openings.  The  same  facts  apply,  mutatis  mutandis. 

3.3.   Lidded  Boxes 

A  box  with  a  lid  is  a  pair  jf  objects  of  which  one  by  itself  is  an  open  box,  but,  arranged 
properly,  they  form  together  a  closed  box.  Formally,  we  define  a  closed  box  in  terms  of  a  box  5, 
a  lid  L,  the  opening  of  the  box  P,  and  a  scene  5  where  the  lid  is  on  the  box.  These  satisfy  the  fol- 
lowing conditions:  (1)  <BJ>>  is  an  open  box;  (2)  L  does  not  overlap  B  in  5;  (3)  L  is  a  superset  of 
P  in  5;  and  (4)  In  5,  L  does  not  contain  the  entire  component  of  the  inside  of  B  \^JP  that  borders 
P.  We  allow  L  to  leak  out  of  P,  because  we  want  the  conditions  on  L  to  be  weaker  than  those  on 
P.  L  is  allowed  to  have  a  handle,  for  example. 

The  main  feature  of  a  box  with  a  lid  is  that,  as  long  as  the  lid  stays  on,  the  box  functions  as 
a  closed  box.  This  is  a  special  case  of  a  more  general  principle:  namely,  that  a  fixed  connected 
arrangement  of  objects  behaves  like  a  single  object  as  long  as  the  arrangement  is  preserved.  This 
is  actually  a  property  of  the  constraints  we  have  dealt  with.  There  are  constraints  where  it  is  false; 
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for  instance,  the  constraint  that  the  robot  must  touch  every  other  object  in  the  room  exactly  once. 
We  may  formally  describe  this  property  as  follows.  A  constraint  C  is  joinable  if,  given  a  history  H 
and  a  set  of  objects  00  such  that  the  relative  positions  of  the  objects  in  00  remains  constant 
throughout  H  and  the  objects  in  00  are  all  connected  during  H,  if  we  form  a  history  H'  by  replac- 
ing the  objects  in  00  by  a  single  object  O' ,  whose  place  in  any  scene  of  H  is  the  union  of  the 
places  of  the  objects  00,  then  H'  is  also  in  C.  (Note  that  the  reverse  is  not  required;  it  is  not 
necessarily  possible  to  break  up  an  object  into  arbitrary  pieces  and  still  remain  within  the  con- 
straint.) It  is  then  a  theorem  that  the  constraints  we  have  considered  --  hardobjs,  push_driven, 
gravity  --  are  joinable,  (making  the  assumption  that  if  00  contains  an  animate  object,  then  O'  is 
to  be  considered  animate.)  The  joinability  property  can  be  used,  under  appropriate  circumstances, 
to  simplify  analysis;  we  first  prove  that  the  objects  in  00  have  fixed  relative  position,  and  then  we 
can  treat  them  as  a  single  object  for  the  rest  of  the  analysis.  In  the  particular  case  of  a  lidded  box, 
if  we  can  prove  that  the  lid  remains  on  the  box  in  closed  position  throughout  a  history,  or  class  of 
histories,  then  all  the  facts  about  closed  boxes  apply. 

The  other  significant  facts  about  lidded  boxes  are  that  there  exist  lidded  boxes  in  which  the 
lid  can  be  taken  off  and  put  on;  aid  that,  if  the  lid  is  completely  removed,  the  box  and  opening 
function  as  an  open  box. 

3.4.  Cages 

A  cage  is  a  generalization  of  a  closed  box.  Cages  are  closed  containers  whose  walls  have 
holes,  such  as  cages,  crates,  chicken  wire  boxes,  pet  carrier  boxes,  and  so  on.  Associated  with  a 
cage  is  an  object  which  goes  inside  the  cage,  and  an  imaginary  shell,  a  pseudo-object  which 
describes  the  limits  of  a  cage.  The  shell  must  be  a  closed  container  which  includes  the  cage  as  a 
subset;  thus,  it  represents  a  filling  in  of  the  holes  of  the  cage.  The  object  must  be  such  that  it  fits 
inside  the  shell  and  such  that  it  cannot  escape  the  cage.  We  formalize  these  conditions  as  follows: 
(Note  that,  for  simplicity,  we  define  a  cage  as  a  predicate  on  objects  and  pseudo-objects  as  con- 
trasted with  our  above  definitions  of  closed  and  open  boxes  as  predicates  on  bodies.) 
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cagciCOf)  iff  /'  C  is  a  cage  for  object  O  with  limits  P  *l 

1)  object(C);  object(O);  C  =  source_of(i>); 

2)  true  (stand_scene,  subset(C,P)); 

3)  box(y); 

4)  fits_inside(0>/,) 

5)  for  all  (//ehardobjs) 

C,0tobjects(//)  implies 
not  [  true(startscene(//),is_inside(0^>))  and 

true(endscene(//),intersect(0, outside^)))  ] 
For  some  purposes,  it  is  useful  to  associate  inner  bounds  with  a  cage.  Intuitively,  the  inner 
bounds  mark  off  the  area  into  which  a  particular  object  cannot  go  from  outside.  This  is  a  signifi- 
cant space  if  the  cage  is  being  used  to  protect  the  insides  from  the  outsides,  as  a  room  with 
screens,  or  a  shark  cage.  Formally,  we  define  inner_cage(C,0TP)  if  conditions  (1)  to  (3)  above  are 
satisfied,  together  with  the  additional  condition 

5')  for  all  (tfthardobjs) 

[  C,0€objects(//)  ]  ut plies 

not  [  true(startsc  ne(//),is_inside(0,/>))  and 

true(endscene(//),intersect(0, outside(/')))  ] 

Two  of  the  major  functional  facts  about  cages  is  explicit  in  the  definition:  an  object  inside  a 
outer  cage  cannot  get  outside;  an  object  outside  an  inner  cage  cannot  get  in.  These  are  analogous 
to  fact  A.3  about  closed  boxes.  Other  important  functional  facts  are  as  follows: 

D.l)  If  <CJ>>  is  an  outer  cage  for  0  and  <C,Q>  is  an  inner  cage  for  U,  and  C  is  equal  to  the 
intersection  between  P  and  Q  and  the  interior  of  Q  contains  P,  then  O  is  protected  from  U 
in  histories  in  which  O  begins  within  P  and  U  begins  outside  of  Q. 

D.2)  An  object  inside  a  cage  can  be  moved  by  pushing  the  cage. 
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D.3)  If  a  box  with  the  shape  of  P  can  be  in  a  particular  way  around  obstacles,  and  <CJP>  is  a 
outer  cage  for  O,  then  C  with  O  inside  may  be  moved  in  the  corresponding  way  around  the 
same  obstacles. 

We  omit  the  straightforward  formalizations. 

Since  "cage"  is,  itself,  a  relational  predicate  on  two  objects,  we  do  not  have  to  introduce  any 
further  geometrical  relations.  A  very  important  geometric  rule  for  dealing  with  cages  is  that  a  box 
with  a  number  of  discrete  holes  is  a  cage  for  any  object  which  fits  in  the  box  and  which  cannot  fit 
through  any  of  the  holes.  This  rule  covers  99%  of  the  outer  cages  one  meets  in  practice.  Other 
important  rules  include  the  following:  "If  <CJy>  is  an  inner  cage  for  0,  and  O  fits  inside  P, 
then  there  is  a  P'  such  that  <CJJ'>  is  an  outer  cage  for  0";  "If  <CJ>>  is  a  outer  cage  for  0 
then,  for  any  O'  which  is  a  superset  of  0  and  fits  inside  P,  there  exists  a  P'  such  that  <Cf>  is 
a  outer  cage  for  0"'\  "If  <Cf>  is  an  inner  or  outer  cage  for  O  and  C"  contains  C  and  is  con- 
tained in  P  then  <C',P>  is  a  cage  for  0." 

like  "box",  the  concept  of  "cage"  can  be  modified,  and  such  concepts  as  open  cages,  cages 
with  multiple  openings,  or  cages  v  "th  lids  can  be  defined.  These  definitions  are  direct  modifica- 
tions of  the  definitions  of  the  con^sponding  types  of  boxes.  For  example,  an  open  outer  cage  for 
an  object  O  is  a  triple  <CJ>,Q>  such  that  C  is  an  object  (the  cage),  and  P  and  Q  are  pseudo- 
objects  (respectively,  the  shell  and  the  opening  of  the  cage);  C  is  a  subset  of  P;  P  JJ  Q  is  a  box;  O 
fits  inside  P{JQ;  and,  observing  the  hard  objects  constraint,  if  0  starts  inside  P\JQ  and  O  never 
intersects  Q  then  O  never  intersects  the  outside  of  P{JQ. 

For  the  most  part,  the  properties  of  boxes  translate  straightforwardly  into  those  of  cages. 
Fact  B.4,  however,  is  somewhat  tricky.  We  may  formulate  the  analogue  in  cages  as  follows.  Let 
<Cf,Q>  be  an  open  cage  for  object  0.  Define  D,  a  vertical  segment,  to  be  a  dead  zone  if  the 
following  facts  are  true: 

1)      Q  is  entirely  above  D; 
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2)  D  is  longer  than  the  maximum  diameter  of  0 

3)  In  any  scene  in  hard_obj_scenes,  if  O  is  entirely  contained  within  D  and  within  P\JQ,  then 
any  point  where  C  abuts  0  does  not  face  upward. 

The  formal  statement  of  B.4  given  on  page  15  can  now  be  asserted,  substituting  "cage"  for 
"box". 

4.   Buttons 

We  next  consider  buttons,  more  briefly  than  boxes.  Real  buttons  attached  to  cloth  with 
thread  are  beyond  the  scope  of  our  theory,  since  they  rely  on  the  flexibility  of  the  cloth  and  the 
thread.  Rather,  we  will  consider  stiff  buttons,  more  like  cufflinks.  A  stiff  button  consists  of  a  but- 
ton attached  at  a  point  to  a  thread,  which  is  is  more  like  a  needle,  and  which  is  attached  in  turn  to 
a  stiff,  flat  cloth.  The  uppercloth  is  likewise  stiff;  the  hole  has  some,  fixed  elliptical  shape. 

Formally,  the  button,  thread,  and  two  cloths  are  all  objects.  Associated  with  the  button  is  a 
pseudo-object  "threadpoint"  denoting  the  point  where  the  thread  attaches.  Associated  with  the 
thread  are  two  points  "buttonpoint"  and  "clothpoint",  denoting  the  two  attachment  points;  these 
are  on  extreme  ends  of  the  thread  in  some  particular  direction,  the  "thread_line*.  Associated  with 
the  lowercloth  is,  again,  a  "thre  dpoint".  Associated  with  the  upper  cloth  is  a  hole  through  the 
cloth. 

The  axiomatization  of  all  this  is  trivial  except  for  "hole_through".  As  with  box  openings, 
there  are  continuous,  topological  deformations  between  cases  which  are  holes  through  object  and 
cases  are  definitely  not,  and  there  does  not  seem  to  be  any  clean  dividing  line  between  the  two.  As 
usual,  we  prefer  to  make  the  definition  broad,  and  simply  add  narrowing  conditions  when  neces- 
sary. The  following  definition  is  purely  topological,  and  seems  to  capture  the  most  essential  quali- 
ties: Body  H  is  a  hole  through  body  B  iff 

1)      H  is  a  simple  solid; 
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2)  B  (~)H  is  homeomorphic  to  a  2-dimensional  band; 

3)  H  lies  in  the  infinite  connected  component  of  the  complement  of  B; 

4)  Let  U  be  the  connected  component  of  the  complement  of  B  which  contains  H.  Then  U-H  is 
connected. 

Condition  (3)  rules  out  the  case  where  B  is  an  inner  tube  and  H  is  a  cross-sectional  slice  on 
the  inside.  Condition  (4)  distinguishes  betweeh  a  hole  through  an  object  and  the  opening  of  an 
open  box. 

We  next  define  a  number  of  predicates  on  scenes  which  describe  various  button  situations.  In 
these  definition  we  will  use  variable  B  to  represent  the  button,  C  for  the  lower  cloth,  T  for  the 
thread,  U  for  the  upper  cloth,  E  for  the  hole  (empty  space)  and  S  for  the  scene.  The  first  predi- 
cate, which  always  holds  in  button  situations,  is  that  the  button  be  attached  to  the  cloth  through 
the  thread.  Formally 

attached_button(fl, T,C,S)  iff 
Sehard_obj_scenes  and  subset({5  ,T,C),  objects(S))  and 
true(5,  threadpoint(5)  =  outtonpoint(r)  and  threadpoint(C)=  clothpoint(r)) 
The  other  predicates  relate  the  button  to  the  hole.  The  button  may  be  open;  it  may  be 
through  the  hole;  or  it  may  be  buttoned.    (These  possibilities  are  neither  exhaustive  nor  mutually 
exclusive.)  A  button  is  open  if  the  button,  cloth,  and  thread  are  completely  separated  from  the 
upper  cloth  and  hole. 

open_button(5 ,T,C,U£,S)  iff 
attached_button(fl ,T,C,S)  and  true  (5,  disjoint(5(jr(JC^:)) 
A  button  is  through  if  the  thread  is  through  the  hole,  and  the  cloth  and  button  are  disjoint 
from  the  hole.  We  define  one  object  being  through  another  statically  analogously  to  the  dynamic 
definition  above  of  one  object  going  through  another.  An  arc  A  is  through  a  face  F  if f  F  can  be 
extended  to  an  infinite  surface  F'  such  that  the  two  ends  of  A  are  on  opposite  sides  of  F'  and  A 
does  not  intersect  F'  -F.  A  body  T  is  through  a  body  E  from  face  F  to  G  of  E  separating  points  p 
and  q  in  T  if  any  arc  from  p  to  q  lying  wholly  in  T  is  through  F  going  inward  and  through  G 
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going  outward. 

The  two  faces  we  need  are  the  open  faces  of  £  in  U;  that  is,  the  two  connected  components 
of  boundary (E—U).  We  now  define  the  button  being  through:  Let  F  and  G  be  the  open  faces  of 
E  in  U.  Then 

through_button(fl,r,C,C,£,5)  iff 

attached_button(fl, T,CJ)  and  disjoint(C,£,5)  and  disjoint(fl,£,5)  and 

is_through(T,£  ,clothpoint(7")  ,buttonpoint(r)  JF  ,G  ,5) 
A  button-opening  history  is  defined  in  the  obvious  way;  it  starts  with  the  button  closed,  ends  with 

it  open,  and  observes  the  constraints. 

button_opening(//,fl, T,C,U£)  iff 
through_button(fl,7\C,t/,£,startscene(//))  and 
open_button(fl,r,C,t/^,endscene(//))  and 

//ehardobjs  and  for  all  (Sescenes(#))  attached_button(5  ,T,CJ) 
We  define  buttonedness  in  buttons  as  relative  to  a  given  set  of  histories,  in  the  same  way  we 
defined  protectedness  above.  A  button  is  buttoned  if  it  cannot  be  opened  staying  within  the  given 
set  of  histories.  For  example,  we  might  be  interested  in  showing  that  the  button  cannot  be  opened 
if  the  upper  cloth  remains  still  a.id  the  button  does  not  undergo  any  rotational  motion;  or  if  the 
only  rotational  motion  is  in  a  plane  parallel  to  the  hole;  or  if  the  threadpoint  of  the  button  does 
not  move  away  from  the  direction  of  the  thread.  For  this  definition,  we  can  get  rid  of  the  depen- 
dence on  how  we  define  the  hole,  and  define  the  relation  purely  in  terms  of  the  physical  objects. 
We  do  this  by  stating  that  the  button  is  buttoned  if  there  is  any  hole  it  cannot  be  opened  through. 

butt oned(fl,7\C,t/, 5 ,////)  iff  /'  HH  is  the  excluded  set  of  histories  *l 
exists  (£)  [through_button(fl,7\C,t/,£,S)  and 

not  exists(//t////)  [button_opening(//,fl,7\C,l/,£)  and  startscene(//)=S  ]  ] 

The  major  properties  of  buttons  are  as  follows: 
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E.l)  If  a  button  can  be  opened  with  respect  to  a  given  hole  then  it  can  be  closed  with  respect  to 

the  same  hole. 
E.2)  If  a  button  may  be  opened  through  a  given  hole  then  it  must  fit  through  the  hole. 

E.3)  If  the  button,  thead,  and  lower  cloth  are  separated  from  the  upper  cloth  by  a  plane,  then  the 
button  is  not  buttoned  in  most  natural  sets  of  histories;  namely,  any  set  of  histories  in  which 
either  the  upper  cloth  or  the  lower  cloth  can  be  translated  in  a  direction  not  crossing  the 
plane  a  distance  greater  than  the  diameter  of  the  button. 

5.   Rings 

A  ring  is  a  body  with  a  hole  through  it,  as  defined  above.  A  related  class  of  bodies  are 
anti-rings  which  have  the  property  that  one  of  the  connected  components  of  their  complement  is  a 
ring.  For  example,  a  closed  box  with  a  pole  on  the  inside  is  an  anti-ring.  An  inner  tube  is  both  a 
ring  and  an  anti-ring. 

The  primary  relation  on  rings  is  interlocking.  The  standard  topological  definition  of  rings 
interlocking  is  that  two  rings  interlock  if  they  contain  two  curves  that  interlock;  and  two  curves 
interlock  if  they  cannot  be  retracted  to  points  without  intersecting.  An  alternative  definition,  more 
useful  for  our  purposes  involves  ae  idea  of  a  "hole  class".  Intuitively,  a  "hole  class"  of  a  ring  O  is 
the  set  of  all  pseudo-object  holes  through  O  corresponding  to  the  same  actual  hole.  Formally,  one 
may  consider  all  the  holes  of  0  as  defining  a  region  in  the  space  of  compact  subsets  of  /?3;  the 
hole  classes  of  0  are  then  the  connected  components  of  this  region.  We  define  a  body  as  inter- 
locking a  ring  if  the  object  intersects  every  hole  in  some  particular  hole  class  of  the  ring. 

The  basic  facts  about  interlocking  are  as  follows: 

F.l)  If  O  interlocks  with  the  ring  R,  then  0  is  either  a  ring  or  an  anti-ring. 

F.2)  If  0  and  R  are  rings  and  0  interlocks  R  then  R  interlocks  0. 

F.3)  Interlocking  cannot  be  broken.  If  0  is  interlocked  with  R  at  the  beginning  of  a  history  which 
observes  the  hard  objects  constraint,  then  O  is  interlocked  with  R  throughout  the  history. 
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F.4)  Interlocked  objects  cannot  be  arbitrarily  far  apart.  In  fact,  if  0  and  R  interlock,  then  the 
convex  hull  of  O  intersects  R ,  and  vice  versa. 

F.5)  If  R  and  O  are  interlocked,  then  it  is  possible  to  move  O  to  an  arbitrary  position  by  pushing 
R  and  then  moving  0  without  moving  R. 

By  the  above  definition,  we  may  show  that  0  does  not  interlock  with  R  by  exhibiting  a  hole 
from  each  hole  class  of  R  which  O  does  not  intersect.  This  is  generally  easy.  To  show  that  two 
objects  do  interlock,  the  following  test  is  useful:  O  interlocks  with  R  iff  there  exist  two  arcs  A  and 
B  with  the  same  starting  and  ending  points,  both  lying  entirely  within  0  and  there  exists  a  hole  H 
of  R  such  that  A  goes  through  H  and  B  does  not  go  through  H. 

The  basic  shape  predicate  with  rings  is  "can_interlock",  which  is  true  of  two  rings  if  there  is 
some  position  in  which  they  do  interlock.  The  easiest  way  to  compute  this  starts  with  circular 
toruses,  and  then  applies  rules  for  subsets. 

G.l)  Two  circular  toruses,  with  inner  radii  R:,  R2  and  cross-sectional  radii  r. ,  r2  can  interlock  iff 
r2<R\  and  rl<R2. 

G.l)  If  O  can  interlock  with  R  thrr  ugh  hole  H  of  R,  and  R'  is  a  subset  of  R  with  a  hole  H'  which 
is  a  superset  of  H,  then  Oca  interlock  with  R' . 

We  may  define  a  broken  ring  analogously  to  a  open  box.  <R,G>  form  a  broken  ring,  with 
ring  R  and  gap  G  iff  /?(JG  is  a  ring,  Rf^G  consists  of  two  simply  connected  surfaces,  and  for 
some  hole  class  of  R\^JG,  every  hole  in  that  class  abuts  both  R  and  G.  (We  can  define  an  anti- 
ring  with  a  gap  similarly.)  A  ring  R'  interlocks  a  broken  ring  <R,G>  if  R'  interlocks  R^JG.  A 
broken  ring  <R' ,G'>  interlocks  <R,G>  if  R'^JG'  interlocks  R[JG.  Two  broken  rings  are  said 
to  "link"  if  there  is  no  history  in  the  hard  objects  constraint  which  takes  them  from  an  interlocked 
situation  to  an  non-interlocked  position.  A  linked  pair  of  broken  rings  is  analogous  to  an  object  in 
a  cage. 

The  important  functional  facts  associated  with  broken  rings  are  as  follows. 
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H.1)  If  /?!  is  a  ring  interlocked  with  the  broken  ring  <R2,G2>,  then,  to  separate  them,  R:  must 

cut  through  G-,.  (This  is  a  different  relation  than  the  "goes  through"  relation  discussed  above. 

We  will  define  it  below.) 
H.2)  If  </?1,G1>  and  <R2,G2>  are  interlocked  broken  rings,  then,  for  any  history  in  which  the 

two  rings  go  from  being  interlocked  to  separated,  either  i?iU^i   cuts  through  G2  or  vice 

versa  or  Gx  overlaps  G2  at  some  instant. 

H.3)  A  broken  ring  0  can  be  suspended  from  a  broken  ring  R  without  falling  if  the  gap  in  R  is  at 
the  top,  and  the  gap  in  O  is  at  the  bottom. 

The  major  new  predicate  here  is  "cutting  through".  We  define  a  closed  curve  C  as  "slicing" 
an  arc  D,  in  the  course  of  a  history,  if  D  may  be  extended  to  an  infinite  (or  closed  curve)  D'  such 
that  C  is  interlocked  with  D'  at  the  beginning  of  the  history,  and  not  interlocked  at  the  end;  and 
the  intersection  between  C  and  D'  always  lies  in  D.  We  further  define  a  closed  curve  C  in  a  ring 
R  as  "going  around"  a  hole  class  HH  of  R  if  there  is  a  hole  HtHH  of  R  and  a  hole  H'  of  C  such 
that  H'  is  a  subset  of  H\JR.  Finally,  we  define  an  arc  A  in  gap  G  as  "bridging"  G  with  respect  to 
the  ring  R  if  the  end  points  of  A  a  e  in  the  two  different  faces  of  G(~\R.  Using  these  predicates, 
we  define  a  ring  R  as  "cutting  th  ough"  a  gap  C  in  a  broken  ring  R'  in  history  H  if  any  closed 
curve  C  in  R  around  a  hole  slices  any  arc  A  bridging  C  with  respect  to  R' . 

cut_through(fl  Ji'  ,G'  Ji)  iff  _ 

R'  =  source_of (C )  and 
exists(Hff) 
[  hole_class(H7/,/?)  and 
for  all  (CA) 

[  closed_curve(C)  and  goes_around(C  ,/?,#?/)  and  bridge_arc(A,G',/?')  and 
J?=source  of(C)  and/?'=source_of(A) 

]  implies  slices(CA^) 

] 

Using  this  predicate,  the  statement  of  facts  H.1  and  H.2  is  straightforward.  The  statements 
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of  H.3  is  quite  analogous  to  B.3. 

Fact  H.3: 

for  all  (R.,GJt2,G2H) 

[tfehard.objs  f)  must.fall  and  ob}ects(H)={RJi1}  and 

broken_ring(#.,G)  and  broken_ring(J?:,G2)and 

interlock  (rt.lJGj^UGi.startscenet//))  and  not  (movesin  (RJi))  and 

for  all  (peplace(G-,startscene(//)),  <7€place(inside(/?:|JG;))  pz>qz  ]  implies 
interlock  (/?.JJG1,R:,UG;,endscene(//)) 

6.  Conclusions. 

The  major  conclusion  from  these  examples  is  encouragement:  our  ontology  is  sufficient  to 
state  most  that  we  want  to  about  the  functioning  of  simple  tools.  When  our  ontology  fails,  it  is 
generally  because  it  supports  only  a  very  primitive  physics,  and  is  therefore  unable  to  handle  prob- 
lems which  require  a  force  analysis.  These  examples  also  suggest  that  the  most  important 
geometric  predicates  for  this  reasoning  are  those  that  relate  to  various  kinds  of  containment,  fit- 
ting through,  and  support.  The  nej.c  step  is  to  develop  a  general  purpose  geometric  representation 
which  makes  these  predicates  eac  /  to  calculate  and  a  method  to  structure  this  knowledge  so  that 
relevant  facts  can  be  easily  retrieved  in  a  given  situation. 
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